Live freelance tracking. Raw descriptions turned into structured data. Find your next tech project without the noise.
upwork.com 🟡 2026-05-02
🔹 Web Scraping for Online Catalog
👤 Client: 🇨🇿 Czech Republic Member since 2014-04-29
💰 Price: $30
🚩 Problem: Scraping a large online catalog with Cloudflare protection within 24 hours.
📦 Existing: [URL]
Specifications:
[Target] - Extract data from 70,000 subpages of an online catalog.
[Method] - Use headless browsers and proxy rotation to bypass Cloudflare.
[UI/UX] - Not applicable.
[Stack] - Python with Scrapy or BeautifulSoup, Selenium for dynamic content, Proxy services like Scrapingant or ScraperDo.
[Security] - Ensure data is handled securely; use encrypted connections and secure storage.
[Format] - Output CSV tables.
Workflow:
1. Set up a headless browser environment with Selenium to handle dynamic content.
2. Implement proxy rotation using services like Scrapingant or ScraperDo to bypass Cloudflare.
3. Write scraping scripts to navigate through pagination and extract URLs for entry details.
4. Develop scripts to scrape data from the 68,000 detail pages.
5. Validate and clean scraped data before exporting to CSV.